Data Introductory

Introduction with data

Importing data set

'data.frame':   15190 obs. of  37 variables:
 $ imdbRating      : chr  "8.4" "8.3" "8.4" "8.3" ...
 $ ratingCount     : chr  "40550" "45319" "81007" "37521" ...
 $ duration        : chr  "3240" "5700" "9180" "6420" ...
 $ nrOfWins        : chr  "1" "2" "3" "1" ...
 $ nrOfNominations : chr  "0" "1" "4" "1" ...
 $ nrOfPhotos      : chr  "19" "35" "67" "53" ...
 $ nrOfNewsArticles: chr  "96" "110" "428" "123" ...
 $ nrOfUserReviews : int  85 122 376 219 186 254 211 180 653 226 ...
 $ nrOfGenre       : int  3 3 2 3 3 3 2 2 3 1 ...
 $ Action          : int  0 0 0 1 0 0 0 0 0 0 ...
 $ Adult           : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Adventure       : int  0 1 0 1 0 0 0 0 0 0 ...
 $ Animation       : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Biography       : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Comedy          : int  1 1 0 1 1 0 1 1 0 0 ...
 $ Crime           : int  0 0 0 0 0 1 0 0 0 0 ...
 $ Documentary     : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Drama           : int  1 0 1 0 1 1 0 1 1 1 ...
 $ Family          : int  1 1 0 0 0 0 0 0 0 0 ...
 $ Fantasy         : int  0 0 0 0 0 0 0 0 0 0 ...
 $ FilmNoir        : int  0 0 0 0 0 0 0 0 0 0 ...
 $ GameShow        : int  0 0 0 0 0 0 0 0 0 0 ...
 $ History         : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Horror          : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Music           : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Musical         : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Mystery         : int  0 0 0 0 0 0 0 0 0 0 ...
 $ News            : int  0 0 0 0 0 0 0 0 0 0 ...
 $ RealityTV       : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Romance         : int  0 0 0 0 1 0 1 0 1 0 ...
 $ SciFi           : int  0 0 1 0 0 0 0 0 0 0 ...
 $ Short           : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Sport           : int  0 0 0 0 0 0 0 0 0 0 ...
 $ TalkShow        : int  0 0 0 0 0 0 0 0 0 0 ...
 $ Thriller        : int  0 0 0 0 0 1 0 0 0 0 ...
 $ War             : int  0 0 0 0 0 0 0 0 1 0 ...
 $ Western         : int  0 0 0 0 0 0 0 0 0 0 ...

From first, I have taken this data set from kaggle, as poorly formatted data set. Fortunately most of the data set were well sorted according to table. But still there were observations which were not rightly formatted, so we had to lose all of them.

source: kaggle website for dataset.

Summary

Summary View (Numeric)

   imdbRating     ratingCount           duration         nrOfWins       
 Min.   :1.000   Min.   :      2.4   Min.   :     2   Min.   :   0.000  
 1st Qu.:6.300   1st Qu.:    492.0   1st Qu.:  3600   1st Qu.:   0.000  
 Median :7.000   Median :   3642.0   Median :  5700   Median :   0.000  
 Mean   :6.864   Mean   :  25890.9   Mean   :  5799   Mean   :   9.918  
 3rd Qu.:7.600   3rd Qu.:  19885.0   3rd Qu.:  6660   3rd Qu.:   2.000  
 Max.   :9.900   Max.   :1183395.0   Max.   :379114   Max.   :7620.000  
 NA's   :1608    NA's   :1248        NA's   :1039     NA's   :388       
 nrOfNominations      nrOfPhotos     nrOfNewsArticles  nrOfUserReviews 
 Min.   :   0.000   Min.   :   0.0   Min.   :    0.0   Min.   :   0.0  
 1st Qu.:   0.000   1st Qu.:   0.0   1st Qu.:    0.0   1st Qu.:   3.0  
 Median :   0.000   Median :   6.0   Median :    8.0   Median :  29.0  
 Mean   :   5.489   Mean   :  23.3   Mean   :  245.7   Mean   : 103.6  
 3rd Qu.:   3.000   3rd Qu.:  25.0   3rd Qu.:   96.0   3rd Qu.: 102.0  
 Max.   :2011.000   Max.   :2810.0   Max.   :32345.0   Max.   :4928.0  
 NA's   :34         NA's   :7        NA's   :1                         

Dependent Exploration


I will present graphs for all the variables individually in order to understand about it’s behavior.

Note:

dependent variable is asymmetric, rightly skewed.

Independant Numeric


Note:

scaling some of the independent variable

would be good idea to implement in order

to get control on sensitivity or have clear

picture about the happening.

Categorical Graphs

Correlation

\(Correlation-Matrix\)


As described in our

correlation matrix darker

color shows the correlation

among explanatory variables.

For instance variable

ratingCount and

nrOfUserReviews highly cor

related with each other.

Hence one can be removed

from modeling.

---
title: "IMDB Prediction"
output: 
  flexdashboard::flex_dashboard:
    orientation: column
    
    social: ["facebook","twitter"]
    theme: yeti
    source_code: embed
---

```{r setup, include=FALSE,cache=T}
library(flexdashboard)
suppressPackageStartupMessages( library(dplyr))
suppressPackageStartupMessages(library(ggplot2))
suppressPackageStartupMessages(library(plotly))
library(caret)
```

Data Introductory 
====================================

Introduction with data {data-width=400}
----------------------------------

### Importing data set
```{r import part}
imdb <- read.csv("imdb.csv") #import of the data file in R.

rating_pred <- imdb[,-c(1:5,9,10)] #removing redundant

str(rating_pred) #checking structure, since our data is a from kaggle. so this is highly                      #unformulated
rating_pred <- na.omit(rating_pred) #removing NAs


## Recoding rating_pred$imdbRating into rating_pred$imdbRating
rating_pred$imdbRating <- as.character(rating_pred$imdbRating)
rating_pred$imdbRating[rating_pred$imdbRating == ""] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " and Gays (TV Episode 2004)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " grAs - Die Serie (TV Series 2000â\200 )"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " hat die Wahl (2000)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Mary (TV Episode 1998)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Paranormal Activity\\"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Sons (TV Episode 2006)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Spion (2011)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Winter... und Frühling (2003)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "achtung fertig charlie"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "alamo der traum das schicksal die legende"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "bergman och filmen bergman och teatern bergman och f r tv movie"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "brows held high shakespeare film and kenneth branagh a retrospective tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "celebrity sch n reich ber hmt"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "cin mas d horreur apocalypse virus zombies"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "coupling wer mit wem perhaps perhaps perhaps tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "crazy stupid love"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "deine meine unsere"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "die mommie die"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "dr seltsam oder wie ich lernte die bombe zu lieben"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "easy riders raging bulls how the sex drugs and rock n roll generation saved hollywood"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "ed edd n eddy tv series"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "eins zwei drei"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "einsam zweisam dreisam"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "estrenos cr ticos mientras duermes contagio sin salida tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "genre mill memories wandering butterflies turkish cats karate warriors and me video"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0009932/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0011237/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0012522/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0013427/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0013442/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0014636/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0015074/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0018051/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0023551/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0024601/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0025452/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0027532/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0029335/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0031385/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0033408/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0033477/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0033563/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0034299/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0034862/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0035795/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0036008/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0036172/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0037101/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0037824/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0037931/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0038059/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0038182/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0038890/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0040580/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0040928/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0041085/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0041113/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0043456/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0043461/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0044084/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0044509/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0044916/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0045125/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0045554/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0045963/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0047562/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0047977/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0048183/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0048545/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0049470/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0049471/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0049762/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0050095/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0052646/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0052893/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0053084/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0053363/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054390/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054518/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054528/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054642/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0055093/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0055233/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0056173/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057171/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057191/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057193/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057547/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0058265/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0058437/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0058536/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0059742/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0059749/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0059903/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060009/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060143/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060304/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060550/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060556/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060955/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061170/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061176/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061610/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061735/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061791/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0062082/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0062467/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063152/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063661/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063925/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063950/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0064418/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0065610/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0065797/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066049/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066108/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066364/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066612/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066730/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0067402/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068182/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068240/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068286/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068555/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069097/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069495/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069547/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069824/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0071411/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0071555/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0072052/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0072973/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0073341/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0074006/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0074851/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0075007/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0075132/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0075323/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076070/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076155/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076451/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076538/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076752/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0077975/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0078841/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080031/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080129/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080388/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080761/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0081353/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0082198/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0082418/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0083715/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0083745/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0084938/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0085933/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0086250/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0086837/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0087365/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0087884/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0089670/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0089907/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0090633/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0090685/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0090852/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091080/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091149/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091209/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091214/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0092593/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0093105/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0093164/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0093543/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0094641/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0094988/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0094991/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0095595/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0096061/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097289/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097523/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097757/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097958/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0098360/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0098724/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0098749/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0099850/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0100050/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0100928/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0101120/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0101329/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0101393/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0102027/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0103927/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0104437/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0104974/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0106168/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0106332/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0107209/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0108174/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0108828/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0108927/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0109771/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0110478/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0111579/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0111942/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0112346/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0114108/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0116059/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0116493/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0117284/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0117786/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0117958/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118004/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118655/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118829/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118843/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118925/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119207/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119229/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119345/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119432/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119465/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119484/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0120789/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0120834/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0120868/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0125061/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0129884/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0130671/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0139735/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0141163/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0147800/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0162360/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0163187/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0164877/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0168590/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0169858/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0179473/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0183505/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0190590/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0192335/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0197521/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0205461/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0212338/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0234853/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0240684/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0242423/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0247144/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0263975/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0264235/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0265713/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0273855/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0276345/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0290002/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0294097/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0306274/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0317836/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0319061/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0323587/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0337921/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0338261/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0343818/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0345561/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0360201/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0362359/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0364517/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0365285/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0365376/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0367110/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0367478/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0373760/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0378072/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0378284/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0381348/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0383028/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0393735/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0394587/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0401711/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0416046/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0418769/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0421054/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0429589/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0432637/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433028/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433383/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433416/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433771/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0442268/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0443295/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0443649/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0449514/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0459293/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0461412/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0462501/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0464913/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0466399/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0466642/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0483703/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0485513/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0494716/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0499516/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0504240/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0520589/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0539476/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0550527/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0566900/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0588221/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0595951/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0609115/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0684837/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0699676/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0708502/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0745906/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0748792/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0758774/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0760329/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0770772/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0775349/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0784159/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0800950/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0808399/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0808506/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0815236/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0815353/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0818276/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0819379/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0820911/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0832266/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0836148/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0862856/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0864311/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0874827/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0879221/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0968294/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0969647/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0970416/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0970866/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0996966/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1017771/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1032815/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1032846/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1067733/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1092633/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1134859/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1137936/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1156312/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1185371/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1189073/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1200078/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1234548/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1237375/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1286537/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1294574/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1303900/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1329457/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1334260/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1336006/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1337117/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1338542/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1353866/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1381010/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1414501/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1421046/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1430966/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1456941/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1488565/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1496905/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1549449/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1586265/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1588334/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1590024/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1601895/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1608777/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1635614/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1649419/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1679204/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1718837/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1740712/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1741225/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1742336/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1814187/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1816777/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1833919/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1860353/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1894193/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1922777/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1923214/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1926567/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1930748/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1934915/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1936736/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1965492/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1981825/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2013841/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2040639/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2059151/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2086872/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2161445/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2175739/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2180851/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2203975/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2205697/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2313197/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2385639/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2396767/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2655706/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2669622/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2739384/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2761156/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt3198848/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt3359268/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt3567828/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "jeanne dielman quai du commerce bruxelles"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "johnny guitar gejagt geha t gef rchtet"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "kill daddy kill"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "liebe l ge leidenschaft tv series"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "love wedding marriage ein plan zum verlieben"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "man taraneh panzdah sal daram"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "mary max oder schrumpfen schafe wenn es regnet"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "micky donald goofy die drei musketiere video"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "o vertrauen verf hrung verrat"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "pepi luci bom und der rest der bande"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "phineas und ferb run candace run last train to bustville tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "psych lock stock some smoking barrels and burton guster s goblet of fire tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "ready steady cook tv series"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "sprung auf marsch marsch"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "to wong foo thanks for everything julie newmar"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "tomorrow yesterday and today video"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "weiblich ledig jung sucht"] <- NA


```
*** 
From first, I have taken this data set from kaggle, as poorly formatted data set. Fortunately most of the data set were well sorted according to table. But still there were observations which were not rightly formatted, so we had to lose all of them. 

> source: kaggle website for dataset.

Summary
---------------------------------- 

### Summary View (Numeric)
```{r summary part}
numeric <- as.data.frame(apply(rating_pred[,c(1:8)], 2, as.numeric)) 
           #converting variables into numeric and excluding NAs
summary(numeric)

cate <- as.data.frame(apply(rating_pred[,c(9:37)], 2,as.factor))## categorical variables in                       ### data set

imdb_rating <- as.data.frame(cbind(numeric,cate))%>%
                     na.omit() #combinning both numeric and categorical variables
```



Dependent Exploration {data-navmenu="Graph"}
===========================================

***
I will present graphs for all the variables individually in order to understand about it's behavior.

`Note:`

dependent variable is asymmetric, rightly skewed.

```{r,cache=TRUE}

p <- imdb_rating %>% na.omit() %>% 
  ggplot(aes(imdbRating))+
  geom_bar(aes(col="red") )+ylab("Count")+xlab("IMDB-Rating")+
  ggthemes::theme_tufte()+
  geom_vline(xintercept = c(4.5,9)) 
ggplotly(p)
```

Independant Numeric {data-navmenu="Graph"}
=========================================
*** 

`Note:`

scaling some of the independent variable

would be good idea to implement in order

to get control on sensitivity or have clear  

picture about the happening.
```{r,cache=TRUE}

p1 <- ggplot(imdb_rating, aes(nrOfNominations,imdbRating))+
  geom_point()+
  geom_jitter(aes(.7))+
  ylab("")
# ggplotly(p1)
 

p2 <- imdb_rating%>%
  ggplot(aes(duration,imdbRating))+
  geom_point()+
  geom_jitter(aes(.7))+
  ylab("")
#ggplotly(p2)

p3 <- imdb_rating%>%
  ggplot(aes(nrOfWins,imdbRating))+
  geom_point()+
  geom_jitter(aes(.7))+
  ylab("")
#ggplotly(p3)

p4 <- imdb_rating%>%
  ggplot(aes(nrOfNewsArticles,imdbRating))+
  geom_point()+
  geom_jitter(aes(.9)) +
  ylab("")
#ggplotly(p4)

p5 <- imdb_rating %>% 
  ggplot(aes(ratingCount,imdbRating))+
  geom_point()+
  geom_jitter(aes(.7))+ylab("")

p6 <- imdb_rating %>% 
  ggplot(aes(nrOfUserReviews,imdbRating))+
  geom_point()+
  geom_jitter(aes(.7))+
  ylab("")
p7 <- imdb_rating %>% 
  ggplot(aes(nrOfPhotos,imdbRating))+
  geom_point()+
  geom_jitter(aes(.7))+
  ylab("")


gridExtra::grid.arrange(p1,p2,p3,p4,p5,p6,p7,nrow=4)
```


Categorical Graphs {data-navmenu="Graph"}
==================================================

```{r,cache=TRUE}

attach(imdb_rating)

bplot <- function(feature,inde){
  p <- ggplot(imdb_rating,aes(feature,imdbRating))+
    geom_boxplot(outlier.colour = "red",outlier.shape = 8)+
    ggthemes::theme_base()+
     theme(
          axis.line.y = element_blank(),
          #axis.text.y = element_blank(),
          axis.title.y = element_blank(),
          axis.ticks = element_blank()
          )+
    #ggtitle("Box-Plot(IMDB~Feature)")+
    #ylab("IMDB")+
    xlab(inde)
  return(p)
  
}

par(mfrow=c(6,7))
bplot(Thriller,"Thriller")
bplot(nrOfGenre,"No of Genre")
bplot(Action,"Action")
bplot(Adult,"Adult")
bplot(Adventure,"Adventure")
bplot(Animation,"Animation")
bplot(Biography,"Biography")
bplot(Comedy,"Comedy")
bplot(Crime,"Crime")
bplot(Documentary,"Documentary")
bplot(Drama,"Drama")
bplot(Family,"Family")
bplot(Fantasy,"Fantasy")
bplot(FilmNoir,"Filmnoir")
bplot(GameShow,"Gameshow")
bplot(History,"History")
bplot(Horror,"Horror")
bplot(Music,"Music")
bplot(Musical,"Musical")
bplot(Mystery,"Mystery")
bplot(News,"News")
bplot(RealityTV,"RealityTV")
bplot(Romance,"Romance")
bplot(SciFi,"Sci-fi")
bplot(Short,"Short")
bplot(Sport,"Sport")
bplot(TalkShow,"Talkshow")
bplot(War,"War")
bplot(Western,"Western")
```

Correlation {data-icon="fa-pencil"}
===================================================

$Correlation-Matrix$


***

As  described   in      our

correlation  matrix  darker

color shows the correlation

among explanatory variables.

For   instance     variable

  `ratingCount`     and 

`nrOfUserReviews` highly cor

related with  each    other.

Hence  one can  be  removed 

from  modeling.

```{r}
numeric %>%
  na.omit() %>%
  cor() %>% 
  corrplot::corrplot(type = "upper")
```